Semantic Feature Representation to Capture News Impact
نویسندگان
چکیده
This paper presents a study where semantic frames are used to mine financial news so as to quantify the impact of news on the stock market. We represent news documents in a novel semantic tree structure and use tree kernel support vector machines to predict the change of stock price. We achieve an efficient computation through linearization of tree kernels. In addition to two binary classification tasks, we rank news items according to their probability to affect change of price using two ranking methods that require vector space features. We evaluate our rankings based on receiver operating characteristic curves and analyze the predictive power of our semantic features. For both learning tasks, the proposed semantic features provide superior results.
منابع مشابه
A Joint Semantic Vector Representation Model for Text Clustering and Classification
Text clustering and classification are two main tasks of text mining. Feature selection plays the key role in the quality of the clustering and classification results. Although word-based features such as term frequency-inverse document frequency (TF-IDF) vectors have been widely used in different applications, their shortcoming in capturing semantic concepts of text motivated researches to use...
متن کاملSemantic Feature Representation to Capture News Impact
Mining natural language text for real world applications can benefit from a data representation that encodes semantic structure. Tree-based features, however, limit the available learning methods. This paper presents a study where semantic frames are used to mine financial news so as to quantify the impact of news on the stock market. We represent news documents in a novel semantic tree structu...
متن کاملGraph Structured Semantic Representation and Learning for Financial News
This study links stock prices of publicly traded companies with online financial news to predict direction of stock price change. Previous work shows this to be an extremely challenging problem. We develop a very high-dimensional representation for news about companies that encodes lexical, syntactic and frame semantic information in graphs. Use of a graph kernel to efficiently compare subgraph...
متن کاملImage Classification via Sparse Representation and Subspace Alignment
Image representation is a crucial problem in image processing where there exist many low-level representations of image, i.e., SIFT, HOG and so on. But there is a missing link across low-level and high-level semantic representations. In fact, traditional machine learning approaches, e.g., non-negative matrix factorization, sparse representation and principle component analysis are employed to d...
متن کاملEffect of Historical Buildings Representation in Cyberspace in Creating Tourists’ Destination Image (Qualitative Study of Traditional Accommodations in Kashan)
Introduction: Understanding the representation components of the historical buildings in cyberspace and their impact on the mental image of the tourists is a significant fact in tourism recognition and management. A part of this subject is the impact of place representation on the destination image of the tourist. In this research, the destination is traditional accommodations that attract tour...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014